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SEQUENCE LISTING 

<110> REILING, KEITH KINKEAD 
NEWMAN, JACK D. 
WITHERS III, SYDNOR T. * 
PITERA, DOUGLAS J. 
KEASLING, JAY D. 
MARTIN, VINCENT J.J. 

<120> METHODS FOR IDENTIFYING A BIOSYNTHETIC PATHWAY GENE PRODUCT 

<130> BERK-032WO 

<140> Unassigned 
<141> 2004-09-29 

<150> 60/507,220 
<151> 2003-09-29 

<160> 37 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 1 

gatctgcagt aggaggaatt aaccatgcat taccgttctt aact 44 

<210> 2 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

<220> • * 
<223> primer 

<400> 2 

ttgatctgcc tcctatgaag tccatggtaa att 33 

<210> 3 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 3 

acttcatagg aggcagatca aatgtcagag ttgagagcct tc 42 

<210> 4 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



1 



wo 2005/033287 



PCT/US2004/032407 



<220> 

<223> primer 



<400> 4 

gagtattacc tcctatttat caagataagt ttc 



33 



<210> 5 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 5 

gataaatagg aggtaatact catgaccgtt tacacagcat cc 42 

<210> 6 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 7 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 7 

gatgtcgact aggaggaata taaaatgaaa aattgtgtca tcgtc 45 

<210> 8 
<211> 34 • 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 8 

ttagcU gtcc tccttaattc aaccgttcaa tcac 34 

<210> 9 
<211> 46 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> primer 



<400> 6 

tacctgcagt tattcctttg gtagaccagt 



30 



<400> 9 

gatgtcgaca ggaggacagc taaatgaa^ac tctcaactaa actttg 



46 



<210> 10 
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<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 10 

agtgtaatcc tccttatttt ttaacatcgt aag 33 

<210> 11 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 11 

ttaaaaaata aggaggatta cactatggtt ttaaccaata aaacag 4 6 

<210> 12 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 12 

atcgtcgact taggatttaa tgcaggtgac ggacc 35 

<210> 13 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400>" 13 

atcccgggag gaggattact atatgcaaac ggaacacgtc 40 

<210> 14 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 14 

atcccgggtt atttaagctg ggtaaatg 28 

<210> 15 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



3 



wo 2005/033287 



PCT/US2004/032407 



<400> 15 



agatccgcgg aggaggaatg agtaatggac tttccgcagc aac 



43 



<210> 16 
<211> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 16 

agtgagagct cttatttatt acgctggatg atgttgggct agcaggagga attcaccatg 60 
agttttgata ttgccaaata c 81 

<210> 17 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 18 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 18 

caaaatcaag gcctggagga cataaatata tgcttcgttc gttgctcaga 50 

<210> 19 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 



<210> 20 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 20 

tgataccatg gactttccgc agcaactcg 29 

<210> 21 
<211> 32 
<212> DNA 



<400> 17 

tctgagcaac gaacgaagca tatatttatg tcctccaggc cttgattttg 



50 



<400> 19 

gcatccatgg tatcatcctc cgttgatgtg atg 



33 
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<213> Artificial Sequence 
<220> 

<223> primer 



<400> 21 

gtacatgcat ttatttatta cgctggatga tg 32 

<210> 22 
<211> 59 
<212> DMA 

<213> Artificial Sequence 



<220> 

<223> primer 



<400> 22 

gggtaccggg ccccccctcg cctctagagt cgactaggag gaatt caeca tgagttttg 59 

<210> 23 
<211> 1671 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic amorphadiene synthase gene 



<400> 23 

gattaaggca tgcaccatgg ccctgaccga agagaaaccg atccgcccga tcgctaactt 60 
cccgccgtct atctggggtg accagttcct gatctacgaa aagcaggttg agcagggtgt 120 
tgaacagatc gtaaacgacc tgaagaaaga agttcgtcag ctgctgaaag aagctctgga 180 
catcccgatg aaacacgcta acctgctgaa actgatcgac gagatccagc gtctgggtat 240 
cccgtaccac ttcgaacgcg aaatcgacca cgcactgcag tgcatctacg aaacctacgg 300 
cgacaactgg aacggcgacc gttcttctct gtggtttcgt ctgatgcgta aacagggcta 360 
ctacgttacc tgtgacgttt ttaacaacta caaggacaag aacggtgctt tcaaacagtc 420 
tctggctaac gacgttgaag gcctgctgga actgtacgaa gcgacctcca tgcgtgtacc 480 
gggtgaaatc atcctggagg acgcgctggg tttcacccgt tctcgtctgt ccattatgac 540 
taaagacgct ttctctacta acccggctct gttcaccgaa atccagcgtg ctctgaaaca 600 
gccgctgtgg aaacgtctgc cgcgtatcga agcagcacag tacattccgt tttaccagca 660 
gcaggactct cacaacaaga ccctgctgaa actggctaag ctggaattca acctgctgca 720 
gtct'ctgcac aaagaagaac tgtctcacgt ttgtaagtgg * tggaaggcat ttgacatcaa 780 
gaaaaacgcg ccgtgcctgc gtgaccgtat cgttgaatgt tacttctggg gtctgggttc 840 
tggttatgaa ccacagtact cccgtgcacg tgtgttcttc actaaagctg tagctgttat 900 
caccctgatc gatgacactt acgatgctta cggcacctac gaagaactga agatctttac 960 
tgaagctgta gaacgctggt ctatcacttg cctggacact ctgccggagt acatgaaacc 1020 
gatctacaaa ctgttcatgg atacctacac cgaaatggag gaatt cctgg caaaagaagg 1080 
ccgtaccgac ctgttcaact gcggtaaaga gtttgttaaa gaattcgtac gtaacctgat 1140 
ggttgaagct aaatgggcta acgaaggcca tatcccgact accgaagaac atgacccggt 1200 
tgttatcatc accggcggtg caaacctgct gaccaccact tgctatctgg gtatgtccga 1260 
catctttacc aaggaatctg ttgaatgggc tgtttctgca ccgccgctgt tccgttactc 1320 
cggtattctg ggtcgtcgtc tgaacgacct gatgacccac aaagcagagc aggaacgtaa 1380 
acactcttcc tcctctctgg aatcctacat gaaggaatat aacgttaacg aggagtacgc 1440 
acagactctg atctataaag aagttgaaga cgtatggaaa gacatcaacc gtgaatacct 1500 
gactactaaa aacatcccgc gcccgctgct gatggcagta atctacctgt gccagttcct 1560 
ggaagtacag tacgctggta aagataactt cactcgcatg ggcgacgaat acaaacacct 1620 
gatcaaatcc ctgctggttt acccgatgtc catctgatcc cgggattaga t 1671 



<210> 24 
<211> 1671 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic amorphadiene synthase gene 



<400> 24 

ctaattccgt 

gggcggcaga 

acttgtctag 

gtagggctac 

gggcatggtg 

gctgttgacc 

gatgcaatgg 

agaccgattg 

cccactttag 

atttctgcga 

cggcgacacc 

cgtcctgaga 

cagagacgtg 

ctttttgcgc 

accaatactt 

gtgggactag 

acttcgacat 

ctagatgttt 

ggcatggctg 

ccaacttcga 

acaatagtag 

gtagaaatgg 

gccataagac 

tgtgagaagg 

tgtctgagac 

ctgatgattt 

ccttcatgtc 

ctagtttagg 



acgtggtacc 
tagaccccac 
catttgctgg 
tttgtgcgat 
aagcttgcgc 
ttgccgctgg 
acactgcaaa 
ctgcaacttc 
taggacctcc 
aagagatgat 
tttgcagacg 
gtgttgttct 
tttcttcttg 
ggcacggacg 
ggtgtcatga 
ctactgtgaa 
cttgcgacca 
gacaagtacc 
gacaagttga 
tttacccgat 
tggccgccac 
ttccttagac 
ccagcagcag 
aggagagacc 
tagatatttc 
ttgtagggcg 
atgcgaccat 
gacgaccaaa 



gggactggct 
tggtcaagga 
acttctttct 
tggacgactt 
tttagctggt 
caagaagaga 
aattgttgat 
cggacgacct 
tgcgcgaccc 
tgggccgaga 
gcgcatagct 
gggacgactt 
acagagtgca 
cactggcata 
gggcacgtgc 
tgctacgaat 
gatagtgaac 
tatggatgtg 
cgccatttct 
tgcttccggt 
gtttggacga 
aacttacccg 
acttgctgga 
ttaggatgta 
ttcaacttct 
cgggcgacga 
ttctattgaa 
tgggctacag 



tctctttggc 
ctagatgctt 
tcaagcagtc 
tgactagctg 
gcgtgacgtc 
caccaaagca 
gttcctgttc 
tgacatgctt 
aaagtgggca 
caagtggctt 
tcgtcgtgtc 
tgaccgattc 
aacattcacc 
gcaacttaca 
acacaagaag 
gccgtggatg 
ggacctgtga 
gctttacctc 
caaacaattt 
atagggctga 
ctggtggtga 
acaaagacgt 
ctactgggtg 
cttccttata 
gcataccttt 
ctaccgtcat 
gtgagcgtac 
gtagactagg 



taggcgggct 
ttcgtccaac 
gacgactttc 
ctctaggtcg 
acgtagatgc 
gactacgcat 
ttgccacgaa 
cgctggaggt 
agagcagaca 
taggtcgcac 
atgtaaggca 
gaccttaagt 
accttccgta 
atgaagaccc 
tgatttcgac 
cttcttgact 
•gacggcgtca 
cttaaggacc 
cttaagcatg 
tggcttcttg 
acgatagacc 
ggcggcgaca 
tttcgtctcg 
ttgcaattgc 
ctgtagttgg 
tagatggaca 
ccgctgctta 
gccctaatct 



agcgattgaa 
tcgtcccaca 
ttcgagacct 
cagacccata 
tttggatgcc 
ttgtcccgat 
agtttgtcag 
acgcacatgg 
ggtaatactg 
gagactttgt 
aaatggtcgt 
tggacgacgt 
aactgtagtt 
cagacccaag 
atcgacaata 
tctagaaatg 
tgtactttgg 
gttttcttcc 
cattggacta 
tactgggcca 
catacaggct 
aggcaatgag 
tccttgcatt 
tcctcatgcg 
cacttatgga 
cggtcaagga 
tgtttgtgga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1671 



<210> 25 
<211> 1185 
<212> DNA 

<213> Escherichia coli 



<400> 25 

atgaaaaatt 

ctcgcttcca 

gcaaaaatcg 

ctggggcaaa 

ggattcacgg 

gccattcagg 

gccccctact 

tatgacgtaa 

accgccgaaa 

ctaca tt c a c 

aaagcgaatt 
acagtcaccg 
gaagaatctg 
agcggtggcg 
ttacaactgg 
gctgcacagt 
aacggcgggg 
acactattac 
ggcggcggtc 



gtgtcatcgt 
"c'ca'gcgccat 
attcacaaca 
atccggcgcg 
tcaataaagt 
caggtcaggc 
tactcgatgc 
tcctgcgcga 
acgtggctaa 
agcgtaaagc 
atgttgtcac 
caacggctga 
ctgggaacgc 
cggcgctggc 
tgccccccgc 
cggggctgca 
tccttgccgt 
ccatcgcgct 
atgccatgca 
agggaattgc 



cagtgcggta 
cgacctgggg 
cgttgatgaa 
tcaggcactg 
atgtggttcg 
gcagagcatt 
aaaagcacgc 
tggcctgatg 
agagtacgga 
ggcagccgca 
tcgaaagaaa 
agcgttaggt 
gtctggtatt 
agcaggcctt 
attgatgggt 
actggcggat 
tgggaaaaac 
cgggcatcct 
ggcacgcgat 
gatggtgatt 



cgtactgcta 
gcgacagtaa 
gtgattatgg 
ttaaaaagcg 
ggtcttaaaa 
gtggcggggg 
tctggttatc 
tgcgccaccc 
attacccgtg 
attgagtccg 
accttcgtct 
gcattgcgcc 
aacgacggtg 
acccccctgg 
atggggccag 
attgatctca 
ctgggctttg 
atcggtgcca 
aaaacgctgg 
gaacggttga 



tcggtagttt 
ttaaagccgc 
gtaacgtgtt 
ggctggcaga 
gtgtggcgct 
gtatggaaaa 
gtcttggaga 
atggttatca 
aaatgcagga 
gtgcttttac 
tcagtcaaga 
cggccttcga 
ctgccgctct 
ctcgcattaa 
tacctgccac 
ttgaggctaa 
attctgagaa 
gtggtgctcg 
ggctggcaac 
attaa 



taacggttca 
cattgaacgt 
acaagccggg 
aacggtgtgc 
tgccgcccag 
tatgagttta 
cggacaggtt 
tatggggatt 
tgaactggcg 
agccgaaatc 
cgaattcccg 
taaagcagga 
ggtgattatg 
aagttatgcc 
gcaaaaagcg 
tgaagcattt 
agtgaatgtc 
tattctggtc 
actgtgcatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1185 



<2T0> 26 
<211> 1476 
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<212> DMA 

<213> Escherichia coli 



<400> 26 

atgaaactct caactaaact ttgttggtgt ggtattaaag gaagacttag gccgcaaaag 60 
caacaacaat tacacaatac aaacttgcaa atgactgaac taaaaaaaca aaagaccgct 120 
gaacaaaaaa ccagacctca aaatgtcggt attaaaggta tccaaattta catcccaact 180 
caatgtgtca accaatctga gctagagaaa tttgatggcg tttctcaagg taaatacaca 240 
attggtctgg gccaaaccaa catgtctttt gtcaatgaca gagaagatat ctactcgatg 300 
tccctaactg ttttgtctaa gttgatcaag agttacaaca tcgacaccaa caaaattggt 360 
agattagaag tcggtactga aactctgatt gacaagtcca agtctgtcaa gtctgtcttg 420 
atgcaattgt ttggtgaaaa cactgacgtc gaaggtattg acacgcttaa tgcctgttac 480 
ggtggtacca acgcgttgtt caactctttg aactggattg aatctaacgc atgggatggt 540 
agagacgcca ttgtagtttg cggtgatatt gccatctacg ataagggtgc cgcaagacca 600 
accggtggtg ccggtactgt tgctatgtgg atcggtcctg atgctccaat tgtatttgac 660 
tctgtaagag cttcttacat ggaacacgcc tacgattttt acaagccaga tttcaccagc 720 
gaatatcctt acgtcgatgg tcatttttca ttaacttgtt acgtcaaggc tcttgatcaa 780 
gtttacaaga gttattccaa gaaggctatt tctaaagggt tggttagcga tcccgctggt 840 
tcggatgctt tgaacgtttt gaaatatttc gactacaacg ttttccatgt tccaacctgt 900 
aaattggtca caaaatcata cggtagatta ctatataacg atttcagagc caatcctcaa 960 
ttgttcccag aagttgacgc cgaattagct actcgcgatt atgacgaatc tttaaccgat 1020 
aagaacattg aaaaaacttt tgttaatgtt gctaagccat tccacaaaga gagagttgcc 1080 
caatctttga ttgttccaac aaacacaggt aacatgtaca ccgcatctgt ttatgccgcc 1140 
tttgcatctc tattaaacta tgttggatct gacgacttac aaggcaagcg tgttggttta 1200 
ttttcttacg gttccggttt agctgcatct ctatattctt gcaaaattgt tggtgacgtc 1260 
caacatatta tcaaggaatt agatattact aacaaattag ccaagagaat caccgaaact 1320 
ccaaaggatt acgaagctgc catcgaattg agagaaaatg cccatttgaa gaagaacttc 1380 
aaacctcaag gttccattga gcatttgcaa agtggtgttt actacttgac caacatcgat 1440 
gacaaattta gaagatctta cgatgttaaa aaataa 1476 



<210> 27 
<211> 1509 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Truncated HMG-CoA reductase nucleotide sequence 



<400> 27 

atggttttaa ccaataaaac agtcatttct ggatcgaaag tcaaaagttt atcatctgcg 60 
caatcgagct' catcaggacc ^tcatcatct agtgaggaag atgattcccg cgatattgaa 120' 
agcttggata agaaaatacg tcctttagaa gaattagaag cattattaag tagtggaaat 180 
acaaaacaat tgaagaacaa agaggtcgct gccttggtta ttcacggtaa gttacctttg 240 
tacgctttgg agaaaaaatt aggtgatact acgagagcgg ttgcggtacg taggaaggct 300 
ctttcaattt tggcagaagc tcctgtatta gcatctgatc gtttaccata taaaaattat 360 
gactacgacc gcgtatttgg cgcttgttgt gaaaatgtta taggttacat gcctttgccc 420 
gttggtgtta taggcccctt ggttatcgat ggtacatctt atcatatacc aatggcaact 480 
acagagggtt gtttggtagc ttctgccatg cgtggctgta aggcaatcaa tgctggcggt 540 
ggtgcaacaa ctgttttaac taaggatggt atgacaagag gcccagtagt ccgtttccca 600 
actttgaaaa gatctggtgc ctgtaagata tggttagact cagaagaggg acaaaacgca 660 
attaaaaaag cttttaactc tacatcaaga tttgcacgtc tgcaacatat tcaaacttgt 720 
ctagcaggag atttactctt catgagattt agaacaacta ctggtgacgc aatgggtatg 780 
aatatgattt ctaaaggtgt cgaatactca ttaaagcaaa tggtagaaga gtatggctgg 840 
gaagatatgg aggttgtctc cgtttctggt aactactgta ccgacaaaaa accagctgcc 900 
atcaactgga tcgaaggtcg tggtaagagt gtcgtcgcag aagctactat tcctggtgat 960 
gttgtcagaa aagtgttaaa aagtgatgtt tccgcattgg ttgagttgaa cattgctaag 1020 
aatttggttg gatctgcaat ggctgggtct gttggtggat ttaacgcaca tgcagctaat 1080 
ttagtgacag ctgttttctt ggcattagga caagatcctg cacaaaatgt tgaaagttcc 1140 
aactgtataa cattgatgaa agaagtggac ggtgatttga gaatttccgt atccatgcca 1200 
tccatcgaag taggtaccat cggtggtggt actgttctag aaccacaagg tgccatgttg 1260 
gacttattag gtgtaagagg cccgcatgct accgctcctg gtaccaacgc acgtcaatta 1320 
gcaagaatag ttgcctgtgc cgtcttggca ggtgaattat ccttatgtgc tgccctagca 1380 
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gccggccatt tggttcaaag tcatatgacc cacaacagga aacctgctga accaacaaaa 14 40 
cctaacaatt tggacgccac tgatataaat cgtttgaaag atgggtccgt cacctgcatt 1500 
aaatcctaa 1509 



<210> 28 
<211> 1332 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 28 

atgtcattac 

gctgtgtaca 

ataagcgagt 

cataagtggt 

ttggccaagg 

ccgttgttag 

atgtttgttt 

cccatcggtg 

gcctacttgg 

catatagtga 

atagataacg 

ggaacaataa 

ctaacctata 

gtcaccgaga 

ctacaaggct 

gtagaaacta 

ctgcttgtct 

gatttgagaa 

ttgttacgaa 

gattttagtt 

gcaaaaaatt 

aaaactacca 

tggacttcat 



cgttcttaac 
acaagcctgc 
catctgcacc 
ccatcaatga 
ctcaacaagc 
ctcaactatc 
gcctatgccc 
ctgggttggg 
gggggttaat 
atcaatgggc 
ctgtggccac 
acacaaacaa 
ctagaattcc 
aatttcctga 
tagagatcat 
ataatgaact 
caatcggtgt 
ttggctccac 
gagacattac 
acgagacatt 
tgaataaaga 
caaagcaaca 
ag 



ttctgcaccg 
cgtcgctgct 
agatactatt 
tttcaatgcc 
caccgatggc 
cgaatccttc 
ccatgccaag 
ctcaagcgcc 
aggatctaat 
cttcataggt 
ttatggtaat 
ttttaagttc 
aaggtctaca 
agttatgaag 
gactaagtta 
gtatgaacaa 
ttctcatcct 
aaaacttacc 
tcaagagcaa 
tgaaacagac 
tcttaaaatc 
aattgacgat 



ggaaaggtta 
agtgtgtctg 
gaattggact 
atcaccgagg 
ttgtctcagg 
cactaccatg 
aatattaagt 
tctatttctg 
gacttggaaa 
gaaaagtgta 
gccctgctat 
ttagatgatt 
aaagatcttg 
ccaattctag 
agtaaatgta 
ctattggaat 
ggattagaac 
ggtgctggtg 
attgacagct 
ttgggtggga 
aaatccctag 
ctattattgc 



ttatttttgg 
cgttgagaac 
tcccggacat 
atcaagtaaa 
aactcgttag 
cagcgttttg 
tttctttaaa 
tatcactggc 
agctgtcaga 
ttcacggtac 
ttgaaaaaga 
tcccagccat 
ttgctcgcgt 
atgccatggg 
aaggcaccga 
tgataagaat 
ttattaaaaa 
gcggcggttg 
tcaaaaagaa 
ctggctgctg 
tattccaatt 
caggaaacac 



tgaacactct 
ctacctgcta 
tagctttaat 
ctcccaaaaa 
tcttttggat 
tttcctgtat 
gtctacttta 
cttagctatg 
aaacgataag 
cccttcagga 
ctcacataat 
tccaatgatc 
tcgtgtgttg 
tgaatgtgcc 
tgacgaggct 
aaatcatgga 
tctgagcgat 
ctctttgact 
attgcaagat 
tttgttaagc 
atttgaaaat 
gaatttacca 



<210> 29 
<211> 1356 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 29 

atgtcagagt 

gttttagata 

gcccatcctt 

caatttaaag 

tcgataggcg 

tttaaaccta 

gatgatgcct 

agttttcatt 

ttagtcacag 

gtagacaaat 

ggtaaaattg 

agattcccac 

aaactggcgc 

ccttcgggat 

gtccagaagg 

gaactcgatc 

gagactcatg 

tgtca-aaagt 

tttagaaaaa 

ttggatgatt 

gacaaaagat 



tgagagcctt 
caaaatatga 
acggttcatt 
atggggagtg 
gatctaagaa 
acatggacga 
accattctca 
cgcacagaat 
ttttaactac 
atagagaagt 
gaagcgggtt 
ccgcattaat 
atttggttga 
taactttatg 
taaaaaattg 
atgcaaattc 
acgattacag 
atcctgaaat 
taactaaaga 
gccagacctt 
L Lycay LyaL 
tttctaaggt 



cagtgcccca 
agcatttgta 
gcaagggtct 
gctgtaccat 
ccctttcatt 
ctactgcaat 
ggaggatagc 
tgaagaagtt 
agctttggcc 
tattcataat 
tgatgtagcg 
ctctaatttg 
tgaagaagac 
gatgggcgat 
gtatgattcg 
tagatttatg 
cgatcagata 
cacagaagtt 
atctggtgcc 
aaaaggagtt 
tracLaagcaa 
tcaatggctg 



gggaaagcgt 
gtcggattat 
gataagtttg 
ataagtccta 
gaaaaagtta 
agaaacttgt 
gttaccgaac 
cccaaaacag 
tccttttttg 
ttagcacaag 
gcggcagcat 
ccagatattg 
tggaatatta 
attaagaatg 
catatgccag 
gatggactat 
tttgagtctc 
agagatigcag 
gatatcgaac 
cttacttgct 
gatgttgatc 
gatgtaactc 
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tactagctgg 
cggcaagaat 
aagtgcgtgt 
aaagtggctt 
tcgctaacgt 
tcgttattga 
atcgtggcaa 
ggctgggctc 
tatcggacct 
ttgctcattg 
atggatctat 
gaagtgctac 
cgattaaaag 
gttcagaaac 
aaagcttgaa 
ctaaactaga 
ttgagaggaa 
ttgccacaat 
ctcccgtaca 
taatacctgg 
ttagggctca 
aggctgactg 



tggatatt'ta 
gcatgctgta 
gaaaagtaaa 
cattcctgtt 
atttagctac 
tattttctct 
cagaagattg 
ctcggcaggt 
ggaaaataat 
tcaagctcag 
cagatataga 
ttacggcagt 
taaccattta 
agtaaaactg 
aatatataca 
tcgcttacac 
tgactgtacc 
tagacgttcc 
aactagctta 
tgctggtggt 
aaccgctaat 
gggtgttagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1332 
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300 

360 
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720 
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aaagaaaaag atccggaaac ttatcttgat aaatag 1356 

<210> 30 
<211> 1191 
<212> DNA 

<213> Saccharoiayces cerevisiae • ' 



<400> 30 

atgaccgttt acacagcatc cgttaccgca 
gggaaaaggg acacgaagtt gaatctgccc 
caagatgacc tcagaacgtt gacctctgcg 
ttgtggttaa atggagaacc acacagcatc 
gacctacgcc aattaagaaa ggaaatggaa 
caatggaaac tccacattgt ctccgaaaat 
tccgctgctg gctttgctgc attggtctct 
tcaacttcag aaatatctag aatagcaaga 
tttggcggat acgtggcctg ggaaatggga 
gtacaaatcg cagacagctc tgactggcct 
gatattaaaa aggatgtgag ttccactcag 
ctatttaaag aaagaattga acatgtcgta 
attgttgaaa aagatttcgc cacctttgca 
catgccacat gtttggactc tttccctcca 
atcatcagtt ggtgccacac cattaatcag 
tttgatgcag gtccaaatgc tgtgttgtac 
gcatttatct ataaattgtt tggctctgtt 
cagcttgagg ctttcaacca tcaatttgaa 
cttgagttgc aaaaggatgt tgccagagtg 
gaaacaaacg aatctttgat tgacgcaaag 

<210> 31 
<211> 9253 
<212> DNA 

<213> Artificial Sequence 



cccgtcaaca tcgcaaccct taagtattgg 60 
accaattcgt ccatatcagt gactttatcg 120 
gctactgcac ctgagtttga acgcgacact 180 
gacaatgaaa gaactcaaaa ttgtctgcgc 24 0 
tcgaaggacg cctcattgcc cacattatct 300 
aactttccta cagcagctgg tttagcttcc 360 
gcaattgcta agttatacca attaccacag 420 
aaggggtctg gttcagcttg tagatcgttg 4 80 
aaagctgaag atggtcatga ttccatggca 540 
cagatgaaag cttgtgtcct agttgtcagc 600 
ggtatgcaat tgaccgtggc aacctccgaa 660 
ccaaagagat ttgaagtcat gcgtaaagcc 720 
aaggaaacaa tgatggattc caactctttc 780 
atattctaca tgaatgacac ttccaagcgt 840 
ttttacggag aaacaatcgt tgcatacacg 900 
tacttagctg aaaatgagtc gaaactcttt 960 
cctggatggg acaagaaatt tactactgag 1020 
tcatctaact ttactgcacg tgaattggat 1080 
attttaactc aagtcggttc aggcccacaa 114 0 
actggtctac caaaggaata a 1191 



<220> 

<223> "single operon" nucleotide sequence 



<400> 31 

gacgcttttt atcgcaactc tctactgttt 
ggaattcacc atggtacccg ggaggaggat 
ttgaatgcac agggagttcc' cacgggtacg 
acccgcttac atctcgcgtt ctccagttgg 
acccgccgcg cactgagcaa aaaagcatgg 
cacccacaac tgggagaaag caacgaagac 
ggcgtggaaa ttacgcctcc tgaatctatc 
ccgagtggca ttgtggaaaa tgaagtgtgt 
ttacagatca atgatgatga agtgatggat 
cacggtattg atgccacgcc gtgggcgttc 
cgcgaagcca gaaaacgatt atctgcattt 
gagtcgacta ggaggaatat aaaatgaaaa 
ctatcggtag ttttaacggt tcactcgctt 
taattaaagc cgccattgaa cgtgcaaaaa 
tgggtaacgt gttacaagcc gggctggggc 
gcgggctggc agaaacggtg tgcggattca 
aaagtgtggc gcttgccgcc caggccattc 
ggggtatgga aaatatgagt ttagccccct 
atcgtcttgg agacggacag gtttatgacg 
cccatggtta tcatatgggg attaccgccg 
gtgaaatgca ggatgaactg gcgctacatt 
ccggtgcttt tacagccgaa atcgtcccgg 
tcttcagtca agacgaattc ccgaaagcga 
gcccggcctt cgataaagca ggaacagtca 



ctccataccc gtttttttgg gctagcagga 60 
tactatatgc aaacggaaca cgtcatttta 120 
ctggaaaagt atgccgeaca cacggcagac 180 
ctgtttaatg ccaaaggaca attattagtt 240 
cctggcgtgt ggactaactc ggtttgtggg 300 
gcagtgatcc gccgttgccg ttatgagctt 360 
tatcctgact ttcgctaccg cgccaccgat 420 
ccggtatttg ccgcacgcac cactagtgcg 4 80 
tatcaatggt gtgatttagc agatgtatta 540 
agtccgtgga tggtgatgca ggcgacaaat 600 
acccagctta aataacccgg ggatcctcta 660 
attgtgtcat cgtcagtgcg gtacgtactg 720 
ccaccagcgc catcgacctg ggggcgacag 780 
tcgattcaca acacgttgat gaagtgatta 840 
aaaatccggc gcgtcaggca ctgttaaaaa 900 
cggtcaataa agtatgtggt tcgggtctta 960 
aggcaggtca ggcgcagagc attgtggcgg 1020 
acttactcga tgcaaaagca cgctctggtt 1080 
taatcctgcg cgatggcctg atgtgcgcca 1140 
aaaacgtgfgc traaagagtac ggaattaccc 1200 
cacagcgtaa agcggcagcc gcaattgagt 1260 
taaatgttgt cactcgaaag aaaaccttcg 1320 
attcaacggc tgaagcgtta ggtgcattgc 1380 
ccgctgggaa cgcgtctggt attaacgacg 1440 
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gtgctgccgc tctggtgatt atggaagaat ctgcggcgct ggcagcaggc cttacccccc 1500 
tggctcgcat taaaagttat gccagcggtg gcgtgccccc cgcattgatg ggtatggggc 1560 
cagtacctgc cacgcaaaaa gcgttacaac tggcggggct gcaactggcg gatattgatc 1620 
tcattgaggc taatgaagca tttgctgcac agttccttgc cgttgggaaa aacctgggct 1680 
ttgattctga gaaagtgaat gtcaacggcg gggccatcgc gctcgggcat cctatcggtg 174 0 
ccagtggtgc tcgtattctg gtcacactat tacatgccat gcaggcacgc gataaaacgc 1800* 
tggggctggc aacactgtgc attggcggcg gtcagggaat tgcgatggtg attgaacggt 1860 
tgaattaagg aggacagcta aatgaaactc tcaactaaac tttgttggtg tggtattaaa 1920 
ggaagactta ggccgcaaaa gcaacaacaa ttacacaata caaacttgca aatgactgaa 1980 
ctaaaaaaac aaaagaccgc tgaacaaaaa accagacctc aaaatgtcgg tattaaaggt 2040 
atccaaattt acatcccaac tcaatgtgtc aaccaatctg agctagagaa atttgatggc 2100 
gtttctcaag gtaaatacac aattggtctg ggccaaacca acatgtcttt tgtcaatgac 2160 
agagaagata tctactcgat gtccctaact gttttgtcta agttgatcaa gagttacaac 2220 
atcgacacca acaaaattgg tagattagaa gtcggtactg aaactctgat tgacaagtcc 2280 
aagtctgtca agtctgtctt gatgcaattg tttggtgaaa acactgacgt cgaaggtatt 2340 
gacacgctta atgcctgtta cggtggtacc aacgcgttgt tcaactcttt gaactggatt 24 00 
gaatctaacg catgggatgg tagagacgcc attgtagttt gcggtgatat tgccatctac 24 60 
gataagggtg ccgcaagacc aaccggtggt gccggtactg ttgctatgtg gatcggtcct 2520 
gatgctccaa ttgtatttga ctctgtaaga gcttcttaca tggaacacgc ctacgatttt 2580 
tacaagccag atttcaccag cgaatatcct tacgtcgatg gtcatttttc attaacttgt 2640 
tacgtcaagg ctcttgatca agtttacaag agttattcca agaaggctat ttctaaaggg 2700 
ttggttagcg atcccgctgg ttcggatgct ttgaacgttt tgaaatattt cgactacaac 2760 
gttttccatg ttccaacctg taaattggtc acaaaatcat acggtagatt actatataac 2820 
gatttcagag ccaatcctca attgttccca gaagttgacg ccgaattagc tactcgcgat 2880 
tatgacgaat ctttaaccga taagaacatt gaaaaaactt ttgttaatgt tgctaagcca 294 0 
ttccacaaag agagagttgc ccaatctttg attgttccaa caaacacagg taacatgtac 3000 
accgcatctg tttatgccgc ctttgcatct ctattaaact atgttggatc tgacgactta 3060 
caaggcaagc gtgttggttt attttcttac ggttccggtt tagctgcatc tctatattct 3120 
tgcaaaattg ttggtgacgt ccaacatatt atcaaggaat tagatattac taacaaatta 3180 
gccaagagaa tcaccgaaac tccaaaggat tacgaagctg ccatcgaatt gagagaaaat 3240 
gcccatttga agaagaactt caaacctcaa ggttccattg agcatttgca aagtggtgtt 3300 
tactacttga ccaacatcga tgacaaattt agaagatctt acgatgttaa aaaataagga 3360 
ggattacact atggttttaa ccaataaaac agtcatttct ggatcgaaag tcaaaagttt 3420 
atcatctgcg caatcgagct catcaggacc ttcatcatct agtgaggaag atgattcccg 3480 
cgatattgaa agcttggata agaaaatacg tcctttagaa gaattagaag cattattaag 3540 
tagtggaaat acaaaacaat tgaagaacaa agaggtcgct gccttggtta ttcacggtaa 3600 
gttacctttg tacgctttgg agaaaaaatt aggtgatact acgagagcgg ttgcggtacg 3660 
taggaaggct ctttcaattt tggcagaagc tcctgtatta gcatctgatc gtttaccata 3720 
taaaaattat gactacgacc gcgtatttgg cgcttgttgt gaaaatgtta taggttacat 3780 
gcctttgccc gttggtgtta taggcccctt ggttatcgat ggtacatctt atcatatacc 3840 
aatggcaact acagagggtf gtttggtagc ttctgccatg cgtggctgta* aggcaatcaa 3900 
tgctggcggt ggtgcaacaa ctgttttaac taaggatggt atgacaagag gcccagtagt 3960 
ccgtttccca actttgaaaa gatctggtgc ctgtaagata tggttagact cagaagaggg 4020 
acaaaacgca attaaaaaag cttttaactc tacatcaaga tttgcacgtc tgcaacatat 4080 
tcaaacttgt ctagcaggag atttactctt catgagattt agaacaacta ctggtgacgc 4140 
aatgggtatg aatatgattt ctaaaggtgt cgaatactca ttaaagcaaa tggtagaaga 4200 
gtatggctgg gaagatatgg aggttgtctc cgtttctggt aactactgta ccgacaaaaa 4260 
accagctgcc atcaactgga tcgaaggtcg tggtaagagt gtcgtcgcag aagctactat 4320 
tcctggtgat gttgtcagaa aagtgttaaa aagtgatgtt tccgcattgg ttgagttgaa 4380 
cattgctaag aatttggttg gatctgcaat ggctgggtct gttggtggat ttaacgcaca 4 44 0 
tgcagctaat ttagtgacag ctgttttctt ggcattagga caagatcctg cacaaaatgt 4 500 
tgaaagttcc aactgtataa cattgatgaa agaagtggac ggtgatttga gaatttccgt 4560 
atccatgcca tccatcgaag taggtaccat cggtggtggt actgttctag aaccacaagg 4 620 
tgccatgttg gacttattag gtgtaagagg cccgcatgct accgctcctg gtaccaacgc 4 680 
acgtcaatta gcaagaatag ttgcctgtgc cgtcttggca ggtgaattat ccttatgtgc 4740 
tgccctagca gccggccatt tggttcaaag tcatatgacc cacaacagga aacctgctga 4 800 
accaacaaaa cctaacaatt tggacgccac tgatataaat cgtttgaaag atgggtccgt 4860 
cacctgcatt aaatcctaag tcgacctgca gtaggaggaa ttaaccatgt cattaccgrt 4 920 
cttaacttct gcaccgggaa aggttattat ttttggtgaa cactctgctg tgtacaacaa 4 980 
gcctgccgtc gctgctagtg tgtctgcgtt gagaacctac ctgctaataa gcgagtcatc 504 0 
tgcaccagat actattgaat tggacttccc ggacattagc tttaatcata agtggtccat 5100 
caatgatttc aatgccatca ccgaggatca agtaaactcc caaaaattgg ccaaggctca 5160 
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acaagccacc 
actatccgaa 
atgcccccat 
gttgggctca 
gttaatagga 
atgggccttc 
ggccacttat 
aaacaatttt 
aattccaagg 
tcctgaagtt 
gatcatgact 
tgaactgtat 
cggtgtttct 
ctccacaaaa 
cattactcaa 
gacatttgaa 
taaagatctt 
gcaacaaatt 
ggcagatcaa 
tggatattta 
gcatgctgta 
gaaaagtaaa 
cattcctgtt 
atttagctac 
tattttctct 
cagaagattg 
ctcggcaggt 
ggaaaataat 
tcaagctcag 
cagatataga 
ttacggcagt 
taaccattta 
agtaaaactg 
aatatataca 
tcgcttacac 
tgactgtacc 
tagacgttcc 
aactagctta 
tgctggtggt 
aaccgctaat 
gggtgttagg 
gaccgtttac 
gaaaagggac 
agatgacctc 
gtggttaaat 
cctacgccaa 
atggaaactc 
cgctgctggc 
aacttcagaa 
tggcggatac 
acaaatcgca 
tattaaaaag 
atttaaagaa 
tgttgaaaaa 
tgccacatgt 
catcagttgg 
tgatgcaggt 
atttatctat 
gcttgaggct 
tgagttgcaa 
aacaaacgaa 
caagcttggc 



gatggcttgt 
tccttccact 
gccaagaata 
agcgcctcta 
tctaatgact 
ataggtgaaa 
ggtaatgccc 
aagttcttag 
tctacaaaag 
atgaagccaa 
aagttaagta 
gaacaactat 
catcctggat 
cttaccggtg 
gagcaaattg 
acagacttgg 
aaaatcaaat 
gacgatctat 
atgtcagagt 
gttttagata 
gcccatcctt 
caatttaaag 
tcgataggcg 
tttaaaccta 
gatgatgcct 
agttttcatt 
ttagtcacag 
gtagacaaat 
ggtaaaattg 
agattcccac 
aaactggcgc 
ccttcgggat 
gtccagaagg 
gaactcgatc 
gagactcatg 
tgtcaaaagt 
tttagaaaaa 
ttggatgatt 
tatgacgcca 
gacaaaagat 
aaagaaaaag 
acagcatccg 
acgaagttga 
agaacgttga 
ggagaaccac 
ttaagaaagg 
cacattgtct 
tttgctgcat 
atatctagaa 
gtggcctggg 
gacagctctg 
gatgtgagtt 
agaattgaac 
gatttcgcca 
ttggactctt 
tgccacacca 
ccaaatgctg 
aaattgtttg 
ttcaaccatc 
aaggatgttg 
tctttgattg 
tgttttggcg 



ctcaggaact 
accatgcagc 
ttaagttttc 
tttctgtatc 
tggaaaagct 
agtgtattca 
tgctatttga 
atgatttccc 
atcttgttgc 
ttctagatgc 
aatgtaaagg 
tggaattgat 
tagaacttat 
ctggtggcgg 
acagcttcaa 
gtgggactgg 
ccctagtatt 
tattgccagg 
tgagagcctt 
caaaatatga 
acggttcatt 
atggggagtg 
gatctaagaa 
acatggacga 
accattctca 
cgcacagaat 
ttttaactac 
atagagaagt 
gaagcgggtt 
ccgcattaat 
atttggttga 
taactttatg 
taaaaaattg 
atgcaaattc 
acgattacag 
atcctgaaat 
taactaaaga 
gccagacctt 
ttgcagtgat 
tttctaaggt 
*atccggaa*ac 
ttaccgcacc 
atctgcccac 
cctctgcggc 
acagcatcga 
aaatggaatc 
ccgaaaataa 
tggtctctgc 
tagcaagaaa 
aaatgggaaa 
actggcctca 
ccactcaggg 
atgtcgtacc 
cctttgcaaa 
tccctccaat 
ttaatcagtt 
tgttgtacta 
gctctgttcc 
aatttgaatc 
ccagagtgat 
acgcaaagac 
gatgagagaa 



cgttagtctt 
gttttgtttc 
tttaaagtct 
actggcctta 
gtcagaaaac 
cggtacccct 
aaaagactca 
agccattcca 
tcgcgttcgt 
catgggtgaa 
caccgatgac 
aagaataaat 
taaaaatctg 
cggttgctct 
aaagaaattg 
ctgctgtttg 
ccaattattt 
aaacacgaat 
cagtgcccca 
agcatttgta 
gcaagggtct 
gctgtaccat 
ccctttcatt 
ctactgcaat 
ggaggatagc 
tgaagaagtt 
agctttggcc 
tattcataat 
tgatgtagcg 
ctctaatttg 
tgaagaagac 
gatgggcgat 
gtatgattcg 
tagatttatg 
cgatcagata 
cacagaagtt 
atctggtgcc 
aaaaggagtt 
tactaagcaa 
tcaatggctg 
ttatcttgat 
cgtcaacatc 
caattcgtcc 
tactgcacct 
caatgaaaga 
gaaggacgcc 
ctttcctaca 
aattgctaag 
ggggtctggt 
agctgaagat 
gatgaaagct 
tatgcaattg 
aaagagattt 
ggaaacaatg 
attctacatg 
ttacggagaa 
cttagctgaa 
tggatgggac 
atctaacttt 
tttaactcaa 
tggtctacca 
gattttcagc 

11 



ttggatccgt 

ctgtatatgt 

actttaccca 

gctatggcct 

gataagcata 

tcagga^tag 

cataatggaa 

atgatcctaa 

gtgttggtca 

tgtgccctac 

gaggctgtag 

catggactgc 

agcgatgatt 

ttgactttgt 

caagatgatt 

ttaagcgcaa 

gaaaataaaa 

ttaccatgga 

gggaaagcgt 

gtcggattat 

gataagtttg 

ataagtccta 

gaaaaagtta 

agaaacttgt 

gttaccgaac 

cccaaaacag 

tccttttttg 

ttagcacaag 

gcggcagcat 

ccagatattg 

tggaatatta 

attaagaatg 

catatgccag 

gatggactat 

tttgagtctc 

agagatgcag 

gatatcgaac 

cttacttgct 

gatgttgatc 

gatgtaactc 

aaataggagg ' 

gcaaccctta 

atatcagtga 

gagtttgaac 

actcaaaatt 

tcattgccca 

gcagctggtt 

ttataccaat 

tcagcttgta 

ggtcatgatt 

tgtgtcctag 

accgtggcaa 

gaagtcatgc 

atggattcca 

aatgacactt 

acaatcgttg 

aatgagtcga 

aagaaattta 

actgcacgtg 

gtcggttcag 

aaggaataac 

ctgatacaga 



tgttagctca 
ttgtttgcct 
tcggtgctgg 
acttgggggg 
tagtgaatca 
ataacgctgt 
caataaacac 
cctatactag 
ccgagaaatt 
aaggcttaga 
aaactaataa 
ttgtctcaat 
tgagaattgg 
tacgaagaga 
ttagttacga 
aaaatttgaa 
ctaccacaaa 
cttcatagga 
tactagctgg 
cggcaagaat 
aagtgcgtgt 
aaagtggctt 
tcgctaacgt 
tcgttattga 
atcgtggcaa 
ggctgggctc 
tatcggacct 
ttgctcattg 
atggatctat 
gaagtgctac 
cgattaaaag 
gttcagaaac 
aaagcttgaa 
ctaaactaga 
ttgagaggaa 
ttgccacaat 
ctcccgtaca 
taatacctgg 
ttagggctca 
aggctgactg 
t a at act cat 
agtattgggg 
ctttatcgca 
gcgacacttt 
gtctgcgcga 
cattatctca 
tagcttcctc 
taccacagtc 
gatcgttgtt 
ccatggcagt 
ttgtcagcga 
cctccgaact 
gtaaagccat 
actctttcca 
ccaagcgtat 
catacacgtt 
aactctttgc 
ctactgagca 
aattggatct 
gcccacaaga 
tgcaggcatg 
ttaaatcaga 



5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
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acgcagaagc ggtctgataa aacagaattt gcctggcggc agtagcgcgg tggtcccacc 8940 
tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc gatggtagtg tggggtctcc 9000 
ccatgcgaga gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact 9060 
gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct cctgagtagg acaaatccgc 9120 
cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg gtggcgggca ggacgcccgc 9180 
cataaactgc caggcatcaa attaagcaga aggccatcct gacggatggc ctttttgcgt 9240 
ttctacaaac tct 9253 



<210> 32 
<211> 4760 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> "MEVT" operon nucleotide secjuence 



<400> 32 

gacgcttttt 

ggaattcacc 

aattgtgtca 

tccaccagcg 

atcgattcac 

caaaatccgg 

acggtcaata 

caggcaggtc 

tacttactcg 

gtaatcctgc 

gaaaacgtgg 

tcacagcgta 

gtaaatgttg 

aattcaacgg 

accgctggga 

tctgcggcgc 

ggcgtgcccc 

ctggcggggc 

cagttccttg 

ggggccatcg 

ttacatgcca 

ggtcagggaa 

ctcaactaaa 

attacacaat 

aaccagacct 

caaccaatct 

gggccaaacc 

tgttttgtct 

agtcggtact 

gtttggtgaa 

caacgcgttg 

tgccggtact 
agcttcttac 
ttacgtcqat 
gagttattcc 
tttgaacgtt 
cacaaaatca 
agaagttgac 
tgaaaaaact 
gattgttcca 
tctattaaac 
cggttccggt 
tatcaaggaa 
ttacgaagct 



atcgcaactc 
atggtacccg 
tcgtcagtgc 
ccatcgacct 
aacacgttga 
cgcgtcaggc 
aagtatgtgg 
aggcgcagag 
atgcaaaagc 
gcgatggcct 
ctaaagagta 
aagcggcagc 
tcactcgaaa 
ctgaagcgtt 
acgcgt ctgg 
tggcagcagg 
ccgcattgat 
tgcaactggc 
ccgttgggaa 
cgctcgggca 
tgcaggcacg 
ttgcgatggt 
ctttgttggt 
acaaacttgc 
caaaatgtcg 
gagctagaga 
aacatgtctt 
aagttgatca 
gaaactctga 
aacactgacg 
ttcaactctt 
tgcggtgata 
gttgctatgt 
atggaacacg 
ggtcattttt 
aagaaggcta 
ttgaaatatt 
tacggtagat 
gccgaattag 
tttgttaatg 
acaaacacag 
tatgttggat 
ttagctgcat 
ttagatatta 
gccatcgaat 



tctactgttt 

gggatcctct 

ggtacgtact 

gggggcgaca 

tgaagtgatt 

actgttaaaa 

ttcgggtctt 

cattgtggcg 

acgctctggt 

gatgtgcgcc 

cggaattacc 

cgcaattgag 

gaaaaccttc 

aggtgcattg 

tattaacgac 

ccttaccccc 

gggtatgggg 

ggatattgat 

aaacctgggc 

tcctatcggt 

cgataaaacg 

gattgaacgg 

gtggtattaa 

aaatgactga" 

gtattaaagg 

aatttgatgg 

ttgtcaatga 

agagttacaa 

ttgacaagtc 

tcgaaggtat 

tgaactggat 

ttgccatcta 

ggatcggtcc 

cctacgattt 

cattaacttg 

tttctaaagg 

tcgactacaa 

tactatataa 

ctactcgcga 

ttgctaagcc 

gtaacatgta 

ctgacgactt 

ctctatattc 

ctaacaaatt 

tgagagaaaa 



ctccataccc 
agagtcgact 
gctatcggta 
gtaattaaag 
atgggtaacg 
agcgggctgg 
aaaagtgtgg 
gggggtatgg 
tatcgtcttg 
acccatggtt 
cgtgaaatgc 
tccggtgctt 
gtcttcagtc 
cgcccggcct 
ggtgctgccg 
ctggctcgca 
ccagtacctg 
ctcattgagg 
tttgattctg 
gccagtggtg 
ctggggctgg 
ttgaattaag 
aggaagactt 
actaaaaaaa 
tatccaaatt 
cgtttctcaa 
cagagaagat 
catcgacacc 
caagtctgtc 
tgacacgctt 
tgaatctaac 
cgataagggt 
tgatgctcca 
ttacaagcca 
ttacgtcaag 
gttggttagc 
cgttttccat 
cgatttcaga 
ttatgacgaa 
attccacaaa 
caccgcatct 
acaaggcaag 
ttgcaaaatt 
agccaagaga 
tgcccatttg 

12 



gtttttttgg 
aggaggaata 
gttttaacgg 
ccgccattga 
tgttacaagc 
cagaaacggt 
cgcttgccgc 
aaaatatgag 
gagacggaca 
atcatatggg 
aggatgaact 
ttacagccga 
aagacgaatt 
tcgataaagc 
ctctggtgat 
ttaaaagtta 
ccacgcaaaa 
ctaatgaagc 
agaaagtgaa 
ctcgtattct 
caacactgtg 
gaggacagct 
aggccgcaaa 
caaaagaccg 
tacatcccaa 
ggtaaataca 
atctactcga 
aacaaaattg 
aagtctgtct 
aatgcctgtt 
gcatgggatg 
gccgcaagac 
attgtatttg 
gatttcacca 
gctcttgatc 
gatcccgctg 
gttccaacct 
gccaatcctc 
tctttaaccg 
gagagagttg 
gtttatgccg 
cgtgttggtt 
gttggtgacg 
atcaccgaaa 
aagaagaact 



gctagcagga 
taaaatgaaa 
ttcactcgct 
acgtgcaaaa 
cgggctgggg 
gtgcggattc 
ccaggccatt 
tttagccccc 
ggtttatgac 
gattaccgcc 
ggcgctacat 
aatcgtcccg 
cccgaaagcg 
aggaacagtc 
tatggaagaa 
tgccagcggt 
agcgttacaa 
atttgctgca 
tgtcaacggc 
ggtcacacta 
cattggcggc 
aaatgaaact 
agcaacaaca 
ctgaacaaaa 
ctcaatgtgt 
caattggtct 
tgtccctaac 
gtagattaga 
tgatgcaatt 
acggtggtac 
gtagagacgc 
caaccggtgg 
actctgtaag 
gcgaatatcc 
aagtttacaa 
gttcggatgc 
gtaaattggt 
aattgttccc 
ataagaacat 
cccaatcttt 
cctttgcatc 
tattttctta 
tccaacatat 
ctccaaagga 
tcaaacctca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 
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aggttccatt gagcatttgc aaagtggtgt ttactacttg accaacatcg atgacaaatt 2760 
tagaagatct tacgatgtta aaaaataagg aggattacac tatggtttta accaataaaa 2820 
cagtcatttc tggatcgaaa gtcaaaagtt tatcatctgc gcaatcgagc tcatcaggac 2880 
cttcatcatc tagtgaggaa gatgattccc gcgatattga aagcttggat aagaaaatac 2940 
gtcctttaga agaattagaa gcattattaa gtagtggaaa tacaaaacaa ttgaagaaca 3000 
aagaggtcgc tgccttggtt attcacggta agttaccttt gtacgctttg gagaaaaaat 3060 
taggtgatac tacgagagcg gttgcggtac gtaggaaggc tctttcaatt ttggcagaag 3120 
ctcctgtatt agcatctgat cgtttaccat ataaaaatta tgactacgac cgcgtatttg 3180 
gcgcttgttg tgaaaatgtt ataggttaca tgcctttgcc cgttggtgtt ataggcccct 3240 
tggttatcga tggtacatct tatcatatac caatggcaac tacagagggt tgtttggtag 3300 
cttctgccat gcgtggctgt aaggcaatca atgctggcgg tggtgcaaca actgttttaa 3360 
ctaaggatgg tatgacaaga ggcccagtag tccgtttccc aactttgaaa agatctggtg 3420 
cctgtaagat atggttagac tcagaagagg gacaaaacgc aattaaaaaa gcttttaact 3480 
ctacatcaag atttgcacgt ctgcaacata ttcaaacttg tctagcagga gatttactct 3540 
tcatgagatt tagaacaact actggtgacg caatgggtat gaatatgatt tctaaaggtg 3600 
tcgaatactc attaaagcaa atggtagaag agtatggctg ggaagatatg gaggttgtct 3660 
ccgtttctgg taactactgt accgacaaaa aaccagctgc catcaactgg atcgaaggtc 3720 
gtggtaagag tgtcgtcgca gaagctacta ttcctggtga tgttgtcaga aaagtgttaa 3780 
aaagtgatgt ttccgcattg gttgagttga acattgctaa gaatttggtt ggatctgcaa 3840 
tggctgggtc tgttggtgga tttaacgcac atgcagctaa tttagtgaca gctgttttct 3900 
tggcattagg acaagatcct gcacaaaatg ttgaaagttc caactgtata acattgatga 3960 
aagaagtgga cggtgatttg agaatttccg tatccatgcc atccatcgaa gtaggtacca 4020 
tcggtggtgg tactgttcta gaaccacaag gtgccatgtt ggacttatta ggtgtaagag 4080 
gcccgcatgc taccgctcct ggtaccaacg cacgtcaatt agcaagaata gttgcctgtg 4140 
ccgtcttggc aggtgaatta tccttatgtg ctgccctagc agccggccat ttggttcaaa 4200 
gtcatatgac ccacaacagg aaacctgctg aaccaacaaa acctaacaat ttggacgcca 4260 
ctgatataaa tcgtttgaaa gatgggtccg tcacctgcat taaatcctaa gtcgacctgc 4320 
aggcatgcaa gcttggctgt tttggcggat gagagaagat tttcagcctg atacagatta 4380 
aatcagaacg cagaagcggt ctgataaaac agaatttgcc tggcggcagt agcgcggtgg 4 44 0 
tcccacctga ccccatgccg aactcagaag tgaaacgccg tagcgccgat ggtagtgtgg 4500 
ggtctcccca tgcgagagta gggaactgcc aggcatcaaa taaaacgaaa ggctcagtcg 4560 
aaagactggg cctttcgttt tatctgttgt ttgtcggtga acgctctcct gagtaggaca 4 620 
aatccgccgg gagcggattt gaacgttgcg aagcaacggc ccggagggtg gcgggcagga 4 68 0 
cgcccgccat aaactgccag gcatcaaatt aagcagaagg ccatcctgac ggatggcctt 474 0 
tttgcgtttc tacaaactct 4760 

<210> 33 
<211> 4482 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> "MEVB" operon nucleotide sequence 
<400> 33 

gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat 60 
gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag 120 
ctatgaccat gattacgcca agcgcgcaat taaccctcac taaagggaac aaaagctggg 180 
taccgggccc cccctcgagg tcgacggtat cgataagctt gatatcgaat tcctgcagta 240 
ggaggaatta accatgtcat taccgttctt aacttctgca ccgggaaagg ttattatttt 300 
tggtgaacac tctgctgtgt acaacaagcc tgccgtcgct gctagtgtgt ctgcgttgag 360 
aacctacctg ctaataagcg agtcatctgc accagatact attgaattgg acttcccgga 420 
cattagcttt aatcataagt ggtccatcaa tgatttcaat gccatcaccg aggatcaagt 480 
aaactcccaa aaattggcca aggctcaaca agccaccgat ggcttgtctc aggaactcgt 540 
tagtcttttg gatccgttgt tagctcaact atccgaatcc ttccactacc atgcagcgtt 600 
ttgtttcctg tatatgtttg tttgcctatg cccccatgcc aagaatatta agttttcttt 660 
aaagtctact ttacccatcg gtgctgggtt gggctcaagc gcctctattt ctgtatcact 720 
ggccttagct atggcctact tgggggggtt aataggatct aatgacttgg aaaagctgtc 780 
agaaaacgat aagcatatag tgaatcaatg ggccttcata ggtgaaaagt gtattcacgg 840 
taccccttca ggaatagata acgctgtggc cacttatggt aatgccctgc tatttgaaaa 900 
agactcacat aatggaacaa taaacacaaa caattttaag t±ct±agatg atttcccagc 960 
cattccaatg atcctaacct atactagaat tccaaggtct acaaaagatc ttgttgctcg 1020 
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cgttcgtgtg ttggtcaccg agaaatttcc tgaagttatg aagccaattc tagatgccat 1080 
gggtgaatgt gccctacaag gcttagagat catgactaag ttaagtaaat gtaaaggcac 1140 
cgatgacgag gctgtagaaa ctaataatga actgtatgaa caactattgg aattgataag 1200 
aataaatcat ggactgcttg tctcaatcgg tgtttctcat cctggattag aacttattaa 1260 
aaatctgagc gatgatttga gaattggctc cacaaaactt accggtgctg gtggcggcgg 1320 
ttgctctttg actttgttac gaagagacat tactcaagag caaattgaca gcttcaaaaa 1380 
gaaattgcaa gatgatttta gttacgagac atttgaaaca gacttgggtg ggactggctg 1440 
ctgtttgtta agcgcaaaaa atttgaataa agatcttaaa atcaaatccc tagtattcca 1500 
attatttgaa aataaaacta ccacaaagca acaaattgac gatctattat tgccaggaaa 1560 
cacgaattta ccatggactt cataggaggc agatcaaatg tcagagttga gagccttcag 1620 
tgccccaggg aaagcgttac tagctggtgg atatttagtt ttagatacaa aatatgaagc 1680 
atttgtagtc ggattatcgg caagaatgca tgctgtagcc catccttacg gttcattgca 174 0 
agggtctgat aagtttgaag tgcgtgtgaa aagtaaacaa tttaaagatg gggagtggct 1800 
gtaccatata agtcctaaaa gtggcttcat tcctgtttcg ataggcggat ctaagaaccc 1860 
tttcattgaa aaagttatcg ctaacgtatt tagctacttt aaacctaaca tggacgacta 1920 
ctgcaataga aacttgttcg ttattgatat tttctctgat gatgcctacc attctcagga 1980 
ggatagcgtt accgaacatc gtggcaacag aagattgagt tttcattcgc acagaattga 2040 
agaagttccc aaaacagggc tgggctcctc ggcaggttta gtcacagttt taactacagc 2100 
tttggcctcc ttttttgtat cggacctgga aaataatgta gacaaatata gagaagttat 2160 
tcataattta gcacaagttg ctcattgtca agctcagggt aaaattggaa gcgggtttga 2220 
tgtagcggcg gcagcatatg gatctatcag atatagaaga ttcccacccg cattaatctc 228 0 
taatttgcca gatattggaa gtgctactta cggcagtaaa ctggcgcatt tggttgatga 234 0 
agaagactgg aatattacga ttaaaagtaa ccatttacct tcgggattaa ctttatggat 2400 
gggcgatatt aagaatggtt cagaaacagt aaaactggtc cagaaggtaa aaaattggta 2460 
tgattcgcat atgccagaaa gcttgaaaat atatacagaa ctcgatcatg caaattctag 2520 
atttatggat ggactatcta aactagatcg cttacacgag actcatgacg attacagcga 2580 
tcagatattt gagtctcttg agaggaatga ctgtacctgt caaaagtatc ctgaaatcac 2640 
agaagttaga gatgcagttg ccacaattag acgttccttt agaaaaataa ctaaagaatc 2700 
tggtgccgat atcgaacctc ccgtacaaac tagcttattg gatgattgcc agaccttaaa 2760 
aggagttctt acttgcttaa tacctggtgc tggtggttat gacgccattg cagtgattac 2820 
taagcaagat gttgatctta gggctcaaac cgctaatgac aaaagatttt ctaaggttca 2880 
atggctggat gtaactcagg ctgactgggg tgttaggaaa gaaaaagatc cggaaactta 294 0 
tcttgataaa taggaggtaa tactcatgac cgtttacaca gcatccgtta ccgcacccgt 3000 
caacatcgca acccttaagt attgggggaa aagggacacg aagttgaatc tgcccaccaa 30 60 
ttcgtccata tcagtgactt tatcgcaaga tgacctcaga acgttgacct ctgcggctac 3120 
tgcacctgag tttgaacgcg acactttgtg gttaaatgga gaaccacaca gcatcgacaa 3180 
tgaaagaact caaaattgtc tgcgcgacct acgccaatta agaaaggaaa tggaatcgaa 3240 
ggacgcctca ttgcccacat tatctcaatg gaaactccac attgtctccg aaaataactt 3300 
tcctacagca gctggtttag cttcctccgc tgctggcttt gctgcattgg tctctgcaat 3360 
tgctaagtta taccaattac cacagtcaac ttcagaaata tctagaatag caagaaaggg 3420 
gtctggttca gcttgtagat cgttgtttgg cggatacgtg gcctgggaaa tgggaaaagc 3480 
tgaagatggt catgattcca tggcagtaca aatcgcagac agctctgact ggcctcagat 3540 
gaaagcttgt gtcctagttg tcagcgatat taaaaaggat gtgagttcca ctcagggtat 3600 
gcaattgacc gtggcaacct ccgaactatt taaagaaaga attgaacatg tcgtaccaaa 3660 
gagatttgaa gtcatgcgta aagccattgt tgaaaaagat ttcgccacct ttgcaaagga 3720 
aacaatgatg gattccaact ctttccatgc cacatgtttg gactctttcc ctccaatatt 3780 
ctacatgaat gacacttcca agcgtatcat cagttggtgc cacaccatta atcagtttta 3840 
cggagaaaca atcgttgcat acacgtttga tgcaggtcca aatgctgtgt tgtactactt 3900 
agctgaaaat gagtcgaaac tctttgcatt tatctataaa ttgtttggct ctgttcctgg 3960 
atgggacaag aaatttacta ctgagcagct tgaggctttc aaccatcaat ttgaatcatc 4020 
taactttact gcacgtgaat tggatcttga gttgcaaaag gatgttgcca gagtgatttt 4080 
aactcaagtc ggttcaggcc cacaagaaac aaacgaatct ttgattgacg caaagactgg 4140 
tctaccaaag gaataactgc agcccggggg atccactagt tctagagcgg ccgccaccgc 4200 
ggtggagctc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 4260 
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 4320 
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 4380 
tgcgcagcct gaatggcgaa tggaaattgt aagcgttaat attttgttaa aattcgcgtt 4440 
aaatttttgt taaatcagct c aLLLLLLa a ccaataggcc ga 4482 



<210> 34 
<211> 549 
<212> DNA 
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<213> Escherichia coli 
<400> 34 

atgcaaacgg aacacgtcat tttattgaat 
aagtatgccg cacacacggc agacacccgc 
aatgccaaag' gacaattatt agttacccgc 
gtgtggacta actcggtttg tgggcaccca 
atccgccgtt gccgttatga gcttggcgtg 
gactttcgct accgcgccac cgatccgagt 
tttgccgcac gcaccactag tgcgttacag 
tggtgtgatt tagcagatgt attacacggt 
tggatggtga tgcaggcgac aaatcgcgaa 
cttaaataa 

<210> 35 
<211> 900 
<212> DNA 

<213> Escherichia coli 
<400> 35 

atggactttc cgcagcaact cgaagcctgc 
tttatcgccc cactgccctt tcagaacact 
ttattaggtg gtaagcgcct gcgacctttc 
gttagcacaa acacgctgga cgcacccgct 
ttaattcatg atgatttacc ggcaatggat 
tgccatgtga agtttggcga agcaaacgcg 
gcgttctcga ttttaagcga tgccgatatg 
atgatttctg aactggcgag cgccagtggt 
gatttagacg cggaaggcaa acacgtacct 
aaaaccggcg cattgattcg cgccgccgtt 
ggacgtcgtg ctctgccggt actcgacaag 
gttcaggatg acatcctgga tgtggtggga 
gccgaccagc aacttggtaa aagtacctac 
aagaaagccc gggatctgat cgacgatgcc 
tcactcgata cctcggcact ggaagcgcta 



gcacagggag ttcccacggg tacgctggaa 60 
ttacatctcg cgttctccag ttggctgttt 120 
cgcgcactga gcaaaaaagc atggcctggc 180 
caactgggag aaagcaacga agacgcagtg 240 
gaaattacgc ctcctgaatc tatctatcct 300 
ggcattgtgg aaaatgaagt gtgtccggta 360 
atcaatgatg atgaagtgat ggattatcaa 4 20 
attgatgcca cgccgtgggc gttcagtccg 480 
gccagaaaac gattatctgc atttacccag 540 

549 



gttaagcagg ccaaccaggc gctgagccgt 60 
cccgtggtcg aaaccatgca gtatggcgca 120 
ctggtttatg ccaccggtca tatgttcggc 180 
gccgccgttg agtgtatcca cgcttactca 240 
gatgacgatc tgcgtcgcgg tttgccaacc 300 
attctcgctg gcgacgcttt acaaacgctg 360 
ccggaagtgt cggaccgcga cagaatttcg 420 
attgccggaa tgtgcggtgg tcaggcatta 480 
ctggacgcgc ttgagcgtat tcatcgtcat 540 
cgccttggtg cattaagcgc cggagataaa 600 
tatgcagaga gcatcggcct tgccttccag 660 
gatactgcaa cgttgggaaa acgccagggt 720 
cctgcacttc tgggtcttga gcaagcccgg 780 
cgtcagtcgc tgaaacaact ggctgaacag 840 
gcggactaca tcatccagcg taataaataa 900 



<210> 36 
<211> 5051 
<212> DNA 

<213> Artificial Sequence" ' 
<220> 

<223> MBI operon 
<400> 36 

gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt tacactttat 60 
gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca caggaaacag 120 
ctatgaccat gattacgcca agcgcgcaat taaccctcac taaagggaac aaaagctggg 180 
taccgggccc cccctcgagg tcgacggtat cgataagctt gatatcgaat tcctgcagta 24 0 
ggaggaatta accatgtcat taccgttctt aacttctgca ccgggaaagg ttattatttt 300 
tggtgaacac tctgctgtgt acaacaagcc tgccgtcgct gctagtgtgt ctgcgttgag 360 
aacctacctg ctaataagcg agtcatctgc accagatact attgaattgg acttcccgga 420 
cattagcttt aatcataagt ggtccatcaa tgatttcaat gccatcaccg aggatcaagt 4 80 
aaactcccaa aaattggcca aggctcaaca agccaccgat ggcttgtctc aggaactcgt 540 
tagtcttttg gatccgttgt tagctcaact atccgaatcc ttccactacc atgcagcgtt 600 
ttgtttcctg tatatgtttg tttgcctatg cccccatgcc aagaatatta agttttcttt 660 
aaagtctact ttacccatcg gtgctgggtt gggctcaagc gcctctattt ctgtatcact 720 
ggccttagct atggcctact tgggggggtt aataggatct aatgacttgg aaaagctgtc 780 
agaaaacgat aagcatatag tgaatcaatg ggccttcata ggtgaaaagt gtattcacgg 84 0 
taccccttca ggaatagata acgctgtggc cacttatggt aatgccctgc tatttgaaaa 900 
agactcacat aatggaacaa taaacacaaa caattttaag ttcttagatg atttcccagc 960 
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cattccaatg atcctaacct atactagaat 
cgttcgtgtg ttggtcaccg agaaatttcc 
gggtgaatgt gccctacaag gcttagagat 
cgatgacgag gctgtagaaa ctaataatga 
aataaatcat ggactgcttg tctcaatcgg 
aaatctgagc gatgatttga gaattggctc 
ttgctctttg actttgttac gaagagacat 
gaaattgcaa gatgatttta gttacgagac 
ctgtttgtta agcgcaaaaa atttgaataa 
attatttgaa aataaaacta ccacaaagca 
cacgaattta ccatggactt cataggaggc 
tgccccaggg aaagcgttac tagctggtgg 
atttgtagtc ggattatcgg caagaatgca 
agggtctgat aagtttgaag tgcgtgtgaa 
gtaccatata agtcctaaaa gtggcttcat 
tttcattgaa aaagttatcg ctaacgtatt 
ctgcaataga aacttgttcg ttattgatat 
ggatagcgtt accgaacatc gtggcaacag 
agaagttccc aaaacagggc tgggctcctc 
tttggcctcc ttttttgtat cggacctgga 
tcataattta gcacaagttg ctcattgtca 
tgtagcggcg gcagcatatg gatctatcag 
taatttgcca gatattggaa gtgctactta 
agaagactgg aatattacga ttaaaagtaa 
gggcgatatt aagaatggtt cagaaacagt 
tgattcgcat atgccagaaa gcttgaaaat 
atttatggat ggactatcta aactagatcg 
tcagatattt gagtctcttg agaggaatga 
agaagttaga gatgcagttg ccacaattag 
tggtgccgat atcgaacctc ccgtacaaac 
aggagttctt acttgcttaa tacctggtgc 
taagcaagat gttgatctta gggctcaaac 
atggctggat gtaactcagg ctgactgggg 
tcttgataaa taggaggtaa tactcatgac 
caacatcgca acccttaagt attgggggaa 
ttcgtccata tcagtgactt tatcgcaaga 
tgcacctgag tttgaacgcg acactttgtg 
tgaaagaact caaaattgtc tgcgcgacct 
ggacgcctca ttgcccacat tatctcaatg 
tcctacagca gctggtttag cttcctccgc 
tgctaagtta taccaattac cacagtcaac 
gtctggttca gcttgtagat cgttgtttgg 
tgaagatggt catgattcca tggcagtaca 
gaaagcttgt gtcctagttg tcagcgatat 
gcaattgacc gtggcaacct ccgaactatt 
gagatttgaa gtcatgcgta aagccattgt 
aacaatgatg gattccaact ctttccatgc 
ctacatgaat gacacttcca agcgtatcat 
cggagaaaca atcgttgcat acacgtttga 
agctgaaaat gagtcgaaac tctttgcatt 
atgggacaag aaatttacta ctgagcagct 
taactttact gcacgtgaat tggatcttga 
aactcaagtc ggttcaggcc cacaagaaac 
tctaccaaag gaataactgc agcccgggag 
attttattga atgcacaggg agttcccacg 
gcagacaccc gcttacatct cgcgttctcc 
ttagttaccc gccgcgcact gagcaaaaaa 
tgtgggcacc cacaactggg agaaagcaac 
gagcttggcg tggaaattac gcctcctgaa 
accgatccga gtggcattgt ggaaaatgaa 
agtgcgttac agatcaatga tgatgaagtg 
gtattacacg gtattgatgc cacgccgtgg 



tccaaggtct acaaaagatc ttgttgctcg 1020 
tgaagttatg aagccaattc tagatgccat 1080 
catgactaag ttaagtaaat gtaaaggcac 1140 
actgtatgaa caactattgg aattgataag 1200 
tgtttctcat cctggattag aacttattaa 1260 
cacaaaactt accggtgctg gtggcggcgg 1320 
tactcaagag caaattgaca gcttcaaaaa 1380 
atttgaaaca gacttgggtg ggactggctg 14 4 0 
agatcttaaa atcaaatccc tagtattcca 1500 
acaaattgac gatctattat tgccaggaaa 1560 
agatcaaatg tcagagttga gagccttcag 1620 
atatttagtt ttagatacaa aatatgaagc 1680 
tgctgtagcc catccttacg gttcattgca 1740 
aagtaaacaa tttaaagatg gggagtggct 1800 
tcctgtttcg ataggcggat ctaagaaccc 18 60 
tagctacttt aaacctaaca tggacgacta 1920 
tttctctgat gatgcctacc attctcagga 1980 
aagattgagt tttcattcgc acagaattga 2040 
ggcaggttta gtcacagttt taactacagc 2100 
aaataatgta gacaaatata gagaagttat 2160 
agctcagggt aaaattggaa gcgggtttga 2220 
atatagaaga ttcccacccg cattaatctc 2280 
cggcagtaaa ctggcgcatt tggttgatga 2340 
ccatttacct tcgggattaa ctttatggat 2400 
aaaactggtc cagaaggtaa aaaattggta 24 60 
atatacagaa ctcgatcatg caaattctag 2520 
cttacacgag actcatgacg attacagcga 2580 
ctgtacctgt caaaagtatc ctgaaatcac 2640 
acgttccttt agaaaaataa ctaaagaatc 2700 
tagcttattg gatgattgcc agaccttaaa 2760 
tggtggttat gacgccattg cagtgattac 2820 
cgctaatgac aaaagatttt ctaaggttca 2880 
tgttaggaaa gaaaaagatc cggaaactta 2940 
cgtttacaca gcatccgtta ccgcacccgt 3000 
aagggacacg aagttgaatc tgcccaccaa 3060 
tgacctcaga acgttgacct ctgcggctac 3120 
gttaaatgga gaaccacaca gcatcgacaa 3180 
acgccaatta agaaaggaaa tggaatcgaa 324 0 
gaaactccac attgtctccg aaaataactt 3300 
tgctggcttt gctgcattgg tctctgcaat 3360 
ttcagaaata tctagaatag caagaaaggg 3420 
cggatacgtg gcctgggaaa tgggaaaagc 3480 
aatcgcagac agctctgact ggcctcagat 3540 
taaaaaggat gtgagttcca ctcagggtat 3600 
taaagaaaga attgaacatg tcgtaccaaa 3660 
tgaaaaagat ttcgccacct ttgcaaagga 3720 
cacatgtttg gactctttcc ctccaatatt 3780 
cagttggtgc cacaccatta atcagtttta 384 0 
tgcaggtcca aatgctgtgt tgtactactt 3900 
tatctataaa ttgtttggct ctgttcctgg 3960 
tgaggctttc aaccatcaat ttgaatcatc 4020 
gttgcaaaag gatgttgcca gagtgatttt 4080 
aaacgaatct ttgattgacg caaagactgg 4140 
gaggattact atatgcaaac ggaacacgtc 4200 
ggtacgctgg aaaagtatgc cgcacacacg 4260 
agttggctgt ttaatgccaa aggacaatta 4320 
gcatggcctg gcgtgtggac taactcggtt 4380 
gaagacgcag tgatccgccg ttgccgttat 4440 
tctatctatc ctgactttcg ctaccgcgcc 4500 
gtgtgtccgg tatttgccgc acgcaccact 4560 
atggattatc aatggtgtga tttagcagat 4 620 
gcgttcagtc cgtggatggt gatgcaggcg 4 680 
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acaaatcgcg aagccagaaa acgattatct 
tccactagtt ctagagcggc cgccaccgcg 
gtattacgcg cgctcactgg ccgtcgtttt 
tacccaactt aatcgccttg cagcacatcc 
ggcccgcacc gatcgccctt cccaacagtt 
agcgttaata ttttgttaaa attcgcgtta 
caataggccg a 

<210> 37 
<211> 5963 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> MBIS operon 
<400> 37 

gcgcaacgca attaatgtga gttagctcac 
gcttccggct cgtatgttgt gtggaattgt 
ctatgaccat gattacgcca agcgcgcaat 
taccgggccc cccctcgagg tcgacggtat 
ggaggaatta accatgtcat taccgttctt 
tggtgaacac tctgctgtgt acaacaagcc 
aacctacctg ctaataagcg agtcatctgc 
cattagcttt aatcataagt ggtccatcaa 
aaactcccaa aaattggcca aggctcaaca 
tagtcttttg gatccgttgt tagctcaact 
ttgtttcctg tatatgtttg tttgcctatg 
aaagtctact ttacccatcg gtgctgggtt 
ggccttagct atggcctact tgggggggtt 
agaaaacgat aagcatatag tgaatcaatg 
taccccttca ggaatagata acgctgtggc 
agactcacat aatggaacaa taaacacaaa 
cattccaatg atcctaacct atactagaat 
cgttcgtgtg ttggtcaccg agaaatttcc 
gggtgaatgt gccctacaag gcttagagat 
cgatgacgag gctgtagaaa ctaataatga 
aataaatcat ggactgcttg tctcaatcgg 
aaatctgagc gatgatttga gaattggctc 
ttgctctttg actttgttac gaagagacat 
' gaaattgcaa gatgatttta gttac'gagac 
ctgtttgtta agcgcaaaaa atttgaataa 
attatttgaa aataaaacta ccacaaagca 
cacgaattta ccatggactt cataggaggc 
tgccccaggg aaagcgttac tagctggtgg 
atttgtagtc ggattatcgg caagaatgca 
agggtctgat aagtttgaag tgcgtgtgaa 
gtaccatata agtcctaaaa gtggcttcat 
tttcattgaa aaagttatcg ctaacgtatt 
ctgcaataga aacttgttcg ttattgatat 
ggatagcgtt accgaacatc gtggcaacag 
agaagttccc aaaacagggc tgggctcctc 
tttggcctcc ttttttgtat cggacctgga 
tcataattta gcacaagttg ctcattgtca 
tgtagcggcg gcagcatatg gatctatcag 
taatttgcca gatattggaa gtgctactta 
agaagactgg aatattacga ttaaaagtaa 
gggcgatatt aagaatggtt cagaaacagt 
tgattcgcat atgccagaaa gcttgaaaat 
atttatggat ggactatcta aactagatcg 
tcagatattt gagtctcttg agaggaatga 
agaagttaga gatgcagttg ccacaattag 
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gcatttaccc agcttaaata acccggggga 4740 
gtggagctcc aattcgccct atagtgagtc 4800 
acaacgtcgt gactgggaaa accctggcgt 4 860 
ccctttcgcc agctggcgta atagcgaaga 4 920 
gcgcagcctg aatggcgaat ggaaattgta 4 980 
aatttttgtt aaatcagctc attttttaac 5040 

5051 



tcattaggca ccccaggctt tacactttat 60 
gagcggataa caatttcaca caggaaacag 120 
taaccctcac taaagggaac aaaagctggg 180 
cgataagctt gatatcgaat tcctgcagta 240 
aacttctgca ccgggadagg ttattatttt 300 
tgccgtcgct gctagtgtgt ctgcgttgag 360 
accagatact attgaattgg acttcccgga 420 
tgatttcaat gccatcaccg aggatcaagt 480 
agccaccgat ggcttgtctc aggaactcgt 54 0 
atccgaatcc ttccactacc atgcagcgtt 600 
cccccatgcc aagaatatta agttttcttt 660 
gggctcaagc gcctctattt ctgtatcact 720 
aataggatct aatgacttgg aaaagctgtc 780 
ggccttcata ggtgaaaagt gtattcacgg 84 0 
cacttatggt aatgccctgc tatttgaaaa 900 
caattttaag ttcttagatg atttcccagc 960 
tccaaggtct acaaaagatc ttgttgctcg 1020 
tgaagttatg aagccaattc tagatgccat 1080 
catgactaag ttaagtaaat gtaaaggcac 1140 
actgtatgaa caactattgg aattgataag 1200 
tgtttctcat cctggattag aacttattaa 12 60 
cacaaaactt accggtgctg gtggcggcgg 1320 
tactcaagag caaattgaca gcttcaaaaa 1380 
atttgaaaca gacttgggtg ' ggactggctg 1440 
agatcttaaa atcaaatccc tagtattcca 1500 
acaaattgac gatctattat tgccaggaaa 1560 
agatcaaatg tcagagttga gagccttcag 1620 
atatttagtt ttagatacaa aatatgaagc 1680 
tgctgtagcc catcct£acg gttcattgca 1740 
aagtaaacaa tttaaagatg gggagtggct 1800 
tcctgtttcg ataggcggat ctaagaaccc 1860 
tagctacttt aaacctaaca tggacgacta 1920 
tttctctgat gatgcctacc attctcagga 1980 
aagattgagt tttcattcgc acagaattga 2040 
ggcaggttta gtcacagttt taactacagc 2100 
aaataatgta gacaaatata gagaagttat 2160 
agctcagggt aaaattggaa gcgggtttga 2220 
atatagaaga ttcccacccg cattaatctc 2280 
cggcagtaaa ctggcgcatt tggttgatga 234 0 
ccatttacct tcgggattaa ctttatggat 2400 
aaaactggtc cagaaggtaa aaaattggta 24 60 
atatacagaa ctcgatcatg caaattctag 2520 
cttacacgag actcatgacg attacagcga 2580 
ctgtacctgt caaaagtatc ctgaaatcac 264 0 
acgttccttt agaaaaataa ctaaagaatc 2700 
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tggtgccgat atcgaacctc ccgtacaaac tagcttattg gatgattgcc agaccttaaa 2760 
aggagttctt acttgcttaa tacctggtgc tggtggttat gacgccattg cagtgattac 2820 
taagcaagat gttgatctta gggctcaaac cgctaatgac aaaagatttt ctaaggttca 2880 
atggctggat gtaactcagg ctgactgggg tgttaggaaa gaaaaagatc cggaaactta 2940 
tcttgataaa taggaggtaa tactcatgac cgtttacaca gcatccgtta ccgcacccgt 3000 
caacatcgca acccttaagt attgggggaa aagggacacg aagttgaatc tgcccaccaa 3060 
ttcgtccata tcagtgactt tatcgcaaga tgacctcaga acgttgacct ctgcggctac 3120 
tgcacctgag tttgaacgcg acactttgtg gttaaatgga gaaccacaca gcatcgacaa 3180 
tgaaagaact caaaattgtc tgcgcgacct acgccaatta agaaaggaaa tggaatcgaa 324 0 
ggacgcctca ttgcccacat tatctcaatg gaaactccac attgtctccg aaaataactt 3300 
tcctacagca gctggtttag cttcctccgc tgctggcttt gctgcattgg tctctgcaat 3360 
tgctaagtta taccaattac cacagtcaac ttcagaaata tctagaatag caagaaaggg 3420 
gtctggttca gcttgtagat cgttgtttgg cggatacgtg gcctgggaaa tgggaaaagc 3480 
tgaagatggt catgattcca tggcagtaca aatcgcagac agctctgact ggcctcagat 354 0 
gaaagcttgt gtcctagttg tcagcgatat taaaaaggat gtgagttcca ctcagggtat 3600 
gcaattgacc gtggcaacct ccgaactatt taaagaaaga attgaacatg tcgtaccaaa 3660 
gagatttgaa gtcatgcgta aagccattgt tgaaaaagat ttcgccacct ttgcaaagga 3720 
aacaatgatg gattccaact ctttccatgc cacatgtttg gactctttcc ctccaatatt 3780 
ctacatgaat gacacttcca agcgtatcat cagttggtgc cacaccatta atcagtttta 3840 
cggagaaaca atcgttgcat acacgtttga tgcaggtcca aatgctgtgt tgtactactt 3900 
agctgaaaat gagtcgaaac tctttgcatt tatctataaa ttgtttggct ctgttcctgg 3960 
atgggacaag aaatttacta ctgagcagct tgaggctttc aaccatcaat ttgaatcatc 4020 
taactttact gcacgtgaat tggatcttga gttgcaaaag gatgttgcca gagtgatttt 4080 
aactcaagtc ggttcaggcc cacaagaaac aaacgaatct ttgattgacg caaagactgg 4140 
tctaccaaag gaataactgc agcccgggag gaggattact atatgcaaac ggaacacgtc 4200 
attttattga atgcacaggg agttcccacg ggtacgctgg aaaagtatgc cgcacacacg 4260 
gcagacaccc gcttacatct cgcgttctcc agttggctgt ttaatgccaa aggacaatta 4320 
ttagttaccc gccgcgcact gagcaaaaaa gcatggcctg gcgtgtggac taactcggtt 4380 
tgtgggcacc cacaactggg agaaagcaac gaagacgcag tgatccgccg ttgccgttat 4440 
gagcttggcg tggaaattac gcctcctgaa tctatctatc ctgactttcg ctaccgcgcc 4500 
accgatccga gtggcattgt ggaaaatgaa gtgtgtccgg tatttgccgc acgcaccact 4560 
agtgcgttac agatcaatga tgatgaagtg atggattatc aatggtgtga .tttagcagat 4 620 
gtattacacg gtattgatgc cacgccgtgg gcgttcagtc cgtggatggt gatgcaggcg 4 680 
acaaatcgcg aagccagaaa acgattatct gcatttaccc agcttaaata acccggggga 4740 
tccactagtt ctagagcggc cgccaccgcg gaggaggaat gagtaatgga ctttccgcag 4800 
caactcgaag cctgcgttaa gcaggccaac caggcgctga gccgttttat cgccccactg 4860 
ccctttcaga acactcccgt ggtcgaaacc atgcagtatg gcgcattatt aggtggtaag 4 920 
cgcctgcgac ctttcctggt ttatgccacc ggtcatatgt tcggcgttag cacaaacacg 4 98 0 
ctggacgcac ccgctgccgc cgttgagtgt atccacgctt actcattaat tcatgatgat 504 0 
ttaccggcaa tggatgatga cgatctgcgt cgcggtttgc caacctgcca tgtgaagttt 5100 
ggcgaagcaa acgcgattct cgctggcgac gctttacaaa cgctggcgtt ctcgatttta 5160 
agcgatgccg atatgccgga agtgtcggac cgcgacagaa tttcgatgat ttctgaactg 5220 
gcgagcgcca gtggtattgc cggaatgtgc ggtggtcagg cattagattt agacgcggaa 5280 
ggcaaacacg tacctctgga cgcgcttgag cgtattcatc gtcataaaac cggcgcattg 534 0 
attcgcgccg ccgttcgcct tggtgcatta agcgccggag ataaaggacg tcgtgctctg 5400 
ccggtactcg acaagtatgc agagagcatc ggccttgcct tccaggttca ggatgacatc 5460 
ctggatgtgg tgggagatac tgcaacgttg ggaaaacgcc agggtgccga ccagcaactt 5520 
ggtaaaagta cctaccctgc acttctgggt cttgagcaag cccggaagaa agcccgggat 5580 
ctgatcgacg atgcccgtca gtcgctgaaa caactggctg aacagtcact cgatacctcg 5640 
gcactggaag cgctagcgga ctacatcatc cagcgtaata aataagagct ccaattcgcc 5700 
ctatagtgag tcgtattacg cgcgctcact ggccgtcgtt ttacaacgtc gtgactggga 5760 
aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 5820 
taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 5880 
atggaaattg taagcgttaa tattttgtta aaattcgcgt taaatttttg ttaaatcagc 594 0 
tcatttttta accaataggc cga 5963 
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